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Abstract 

Improving a result of Dyer, Frieze and Greenhill [Journal of Combinatorial Theory, Series B, 2015], we deter¬ 
mine the ^y-colorability threshold in random fc-uniform hypergraphs up to an additive error of ln2 -)- e^, where 
limq^cxD Eq - 0. The new lower bound on the threshold matches the “condensation phase transition” predicted 
by statistical physics considerations [Krzakala et al., PNAS 2007]. 

Mathematics Subject Classification: 05C80 (primary), 05C15 (secondary) 


1 Introduction 

Recent work on random constraint satisfaction problems has focused either on the case of binary variables and 
fc-ary constraints (e.g., random fc-SAT) or on the case of fc-ary variables and binary constraints (e.g., random graph 
coloring) for some fc > 3. In these two cases substantial progress has been made over the past few years. For 
instance, the fc-SAT threshold has been identified precisely for large enough fc [12]. Moreover, in the random hy¬ 
pergraph 2-coloring problem (or equivalently the fc-NAESAT problem) the threshold is known up to an error term 
that tends to 0 rapidly in terms of the size fc of the edges [11]. In addition, the best current upper and lower bounds 
on the fc-colorability threshold of the Erdos-Renyi random graph are within a small additive constant [9]. By com¬ 
parison, little is known about problems in which both the arity of the constraints and the domain of the variables 
have size greater than two. Although it has been asserted tbat tbe techniques developed in recent work should 
carry over [9], this claim has hardly been put to the test. In fact, although the fc-XORSAT threshold (random linear 
equations with fc variables apiece over the field of size 2) has been known for a while [13, 28], the case of equations 
of length fc > 3 over fields of size greater than two largely remains elusive (apart from [16]). 

The present paper deals with one of the most natural examples of a problem with fc-ary constraints and q-ary 
variables with q, fc > 3, namely q-colorability of random fc-uniform hypergraphs. Let [m] denote the set [1,..., m] 
for any positive integer m. To be precise, by a q-coloring of G = (V,E) we mean a map a -.V ^ [q] such that 
|cr(e)| > 1 for all e e E, i.e., no edge is monochromatic. The chromatic number of G is the least q for which a 

*The research leading to these results has received funding from the European Research Council under the European Union’s Seventh Frame¬ 
work Programme {FP/2007-2013) / ERC Grant Agreement n. 278857-PTCC. 

^Research supported by the Australian Research Council Discovery Project DP140101519. 
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t 7 -coloring exists. The random hypergraph model that we consider is the most natural one, i.e., e ‘^{n, k, m) 
is a (simple) fc-uniform hypergraph on the vertex set [n] := {1,2,3,..., n} with a set of precisely m edges chosen 
uniformly at random. 

For every q > 2,k > 3 there exists a (non-uniform) sharp threshold Cq^k = Cq^^in) for ( 7 -colorahility [19]. That 
is, if m = m{n) is a sequence such that for some fixed e > 0 we have m{n) < (1 - £)ncq^kin), then '^{n, k, m) is q- 
colorahle w.h.p., whereas w.h.p. the random hypergraph fails to he ^ 7 -colorahle if m[n) > (1 -i- e)ncq^kin). The hest 
prior hounds on this threshold, obtained hy Dyer, Frieze and Greenhill [14, Remark2.1, (82)], read 

{q^~^ - 1 )In <7 - 1 - Eq j^ < liminf Cq i^{n) < limsup Cq f^{ri) < iq^~^ - 1 / 2 ] ln^ 7 , ( 1 . 1 ) 

where limj^^co Eq,k - 0 for any fixed k>3. Thus, the upper and the lower hound differ hy an additive ^ In < 7 - 1 - l+Eq i^, 
a term that diverges in the limit of large q. The main result of this paper provides an improved lower hound that is 
within an additive ln 2 of the upper hound from ( 1 . 1 ), in the large -(7 limit. 

Theorem 1.1. For each fc > 3 there is a number qo = qoik) > 0 such that for all q> qo we have 

liminf Cqk{n}> {q^~^ - \l2)\nq - \n2 - X.QWnqlq. 

The proof of Theorem 1.1 is based on the second moment method. So is [14], which generalises the second 
moment argument of Achlioptas and Naor [4] from graphs to hypergraphs. The result of Achlioptas and Naor 
was recently improved by Coja-Oghlan and Vilenchik [9], and in this paper we generalise the argument from that 
paper to hypergraphs. AA/hile numerous details need adjusting, the basic proof strategy that we pursue is similar 
to the one suggested in [9]. In particular, the improvement over [14] results from studying the second moment of 
a subtly chosen random variable. While the random variable considered in [14] is just the number of (balanced) 
^ 7 -colorings of the random hypergraph, here we use a random variable that is inspired by ideas from statistical 
mechanics; we will give a more detailed outline in Section 3 below. Thus, the present paper shows that, indeed, 
with a fair number of careful modifications the method from [9] can be generalised to hypergraphs. 


Notation. We assume throughout that the number of vertices, n, is sufficiently large for our estimates to hold. We 
also assume that the number of colors, q, exceeds some large enough constant qo = qolk). But of course q, k are 
always assumed to remained fixed as 00 . 

We use the 0-notation to refer to the limit n ^ 00 . For example, f{n) - 0(g(n]] means that there exists some 
C > 0, no > 0 such that for all n> no we have l/(n)| < C-|g(n)|. In addition, o(-),Q(-),0(-) take their usual definitions, 
except that we assume the expression 0(n) is positive (for sufficiently large n] whenever we write exp{-Fl{n)). We 
write/(n) ~ g{n) if lim„_oo/(«)/g(«) = 1- 

When discussing estimates that hold in the limit of large q we will make this explicit by adding the subscript q 
to the asymptotic notation. Therefore, f{q) = Oqigiq)) means that there exists positive constants C, qo such that 
for all q > qo we have |/(^7)| < C• |g(^7)|. Furthermore, we will write f{q) = Oqigiq)) to indicate that there exists 
positive C, qo such that for all q> qowe have \fiq)\ < Onq)^ ■ |g(<7)|. 


2 Related work 

The quest for the chromatic number of random graphs (i.e., ^in,2,m)) goes back to the seminal 1960 paper 
of Erdos and Renyi in which they established the “giant component” phase transition [15]. But it took almost 
thirty years until a celebrated paper of Bollobas [7] determined the asymptotic value of the chromatic number of 
dense random graphs. Elis proof used martingale tail bounds, which were introduced to combinatorics by Shamir 
and Spencer [29] to investigate the concentration of the chromatic number. Building upon ideas of Matula [27], 
Luczak [25] determined the asymptotic value of the chromatic number of the Erdos-Renyi random graph in the 
case that m - min) satisfies mln^ 00 . Flowever, the results from [7, 25] only determine the chromatic number 
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up to a multiplicative error of 1 + o(l) as n ^ oo, and the resulting error term exceeds the width within which 
the chromatic number is known to be concentrated. Indeed, in the case that m = m(n) < it is known 

that the chromatic number of the random graph is concentrated on two subsequent integers [6, 26]. In the sparse 
case m = 0[n) the precise values of these two integers are implied by the current bounds on the ^/-colorability 
threshold [4, 8, 9]. 

The 2-colorability problem in random hypergraphs, which is essentially equivalent to the random fc-NAESAT 
problem, has also been studied. Achlioptas and Moore [2, 3] showed that the 2-colorability threshold can be ap¬ 
proximated within a small additive constant via the second moment method. Furthermore, Coja-Oghlan and Zde- 
borova [10] established the existence of a further phase transition apart from the threshold for 2-colorability, the 
“condensation phase transition”. The name derives from an intriguing connection to the statistical mechanics of 
glasses [21, 23]. Moreover, the argument of Coja-Oghlan and Panagiotou [11] determines the 2-colorability thresh¬ 
old in fc-uniform random hypergraphs up to an additive error term Ck that tends to 0 exponentially as a function 
of k. 

Prior to the aforementioned work of Dyer, Frieze and Greenhill [14] the q-colorability problem in hypergraphs 
was studied by Krivelevich and Sudakov [22], who also considered other possible notions of colorings. Their results 
are of a similar nature to Luczak’s [25] in the case of graphs. That is, they determine the value of the chromatic 
number up to a multiplicative 1 -t o(l) factor, with o(l) hiding a term that vanishes as m/n ^ oo. The same is 
true of the results of Kupavskii and Shabanov [24], which partly improve upon [22]. Flowever, the bounds on the 
q-colorability threshold that can be read out of [22, 24] are less precise than those obtained in [14] (upon which 
Theorem 1.1 improves). 


3 Outline 

Throughout, we assume that n is sufficiently large for our error estimates to hold, and that q > qo- Further, we 
assume that m = \cn\ and for ease of notation will often write cn rather than \cn\. 


The second moment method. The second moment method has become the mainstay for lower-bounding satis¬ 
fiability thresholds [2, 5,17]. 

Suppose that we can construct a non-negative random variable Z on ^(n, k, cn) such that the event Z(G) > 0 
implies q-colorability, and such that 

E[Z2] = 0(E[Z]2) asn^oo. (3.1) 


Then the Paley-Zygmund inequality implies that 


E[Z]^ 

liminf P [Z > 0] > liminf > 0. 

n^oo n^oo E[Z^] 


(3.2) 


Combining (3.2) with the sharp threshold result from [19], which establishes the existence of a sharp threshold 
sequence Cq k yields liminf„^oo Cij.fc W S c. Flence, the second moment method can be summarised as follows. 

Fact 3.1. If there is a non-negative random variable Z On'S in, k,cn) such thatZ > 0 implies q-color ability and (3.1) 
holds, then liminfn^oo Cq^kin) > c. 


Thus, our task is to exhibit a random variable Z on'Sin, k, cn) that satisfies (3.1) for as large a value of c as possible. 


Balanced colorings. Certainly the most natural choice for Z seems to be the number Zq of q-colorings of the 
random hypergraph. Clearly, Zq >0 and Zq{G) > 0 only if G is q-colorable. However, technically Zq is a bit un¬ 
wieldy. Therefore, following Achlioptas and Naor [4], Dyer, Frieze and Greenhill [14] considered a slightly modified 
random variable. Namely, let us call a map cr: [n] ^ [q] balanced if |(t“^ (j) -nlq\< \/n for all j e [q] and let Z^^ ^al 
be the number of balanced < 7 -colorings of S. 
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(3.3) 


Lemma 3.2 ([14]). For any q,k>3 and any c> 0 we have 


and 

lim — lnE[Zo]= lim — lnE[Zobal] = lnfl +clnfl - . 

n—»oo J2 ^ n—*oo fi ^ ‘ 


(3.4) 


Proof. Calculations similar to the following ones were performed in [14]; we repeat them here to keep the paper 
self-contained. Given a balanced map a ■. [n] ^ [q], let = \a~^{i)\ln for i e [q] and define a - [ai,...,aq). 
Stirling’s formula yields 


P [cr is a proper < 7 -coloring of 


(n\ 


■^q (aifiW 


-1 

/ 


1 


^i=\ 1 k ) 

Ui 

= 0 

exp 

cn In 

1-E4 


cn 

cn^ 




1 *e|'?l 



(3.5) 


cf. [14, equation ( 8 )]. Let d- {\l q,...,\l q) denote the uniform distribution on [q\. The gradient of the function 

/: (xi.Xfc) ln{l - Z/eiqi at the point d is simply the vector Wf{d) with every entry equal to fc(l - q^~^]~^. 

Consequently, because Y.is.[q] (“/ - l/<7) =0, expanding / to the second order around d yields 


In 


1-E 

!£[?]/ 


- f[a) - f{a) + Vf{a)[a-a) - 1 - 0(||q:- ajlj) = ln(l - q^ + 0(||a- ajj^). 


(3.6) 


Since a is balanced, we have the bound ||a:- djl^ = 0(l/n). Therefore, combining (3.5) and (3.6), we obtain 

P [a is a proper ^/-coloring of(^] = 0|(1- (3.7) 

uniformly for all balanced a. Finally, the number of balanced maps corresponding to a given “ is „ „) = 

Stirling’s formula, and the number of choices for the vector a is 0 (n^ 4 -i)/ 2 j Hence the total 
number of balanced maps a is Q{q"). Combining this with (3.7) implies (3.3). 

Next, as observed in the proof of [14, Lemma 2.1], the probability that a map a ‘.[n] ^ [q] is a q-colouring of ^ 
is maximised when a is perfectly balanced, and this probability equals 0(1) (l - (Here the 0(1) factor is 

needed only when q does not divide n.) Hence, by linearity of expectation, 

E[Zg,bai] < E[Z^[ = 0(1) [qa-q^-'^ry, 

which differs from (3.3) by at most a constant factor. This implies (3.4), completing the proof. □ 


It is easily verified that the r.h.s. of (3.4) is positive if c < (q^~^ - j)lnq- ln2. Hence, for such c, both ElZ^,] 
and E[Z(^_bal] are exponential in n. They differ only in their sub-exponential terms. Consequently, we do not give 
anything away by confining ourselves to balanced colorings only. In the following we will see why neither Zq nor 
Zq,bal is a good random variable to work with and why neither can be used to prove Theorem 1.1. What we learn 
will guide us towards constructing a better random variable. 

While working out the first moment of Z^ bai (i e., the proof of Lemma 3.2) is pretty straightforward, getting 
a handle on the second moment is not quite so easy. 0 f course, the second moment of Z^^ bal is nothing but the 
expected number of pairs of balanced q-colorings. Moreover, the probability that two maps cr,T : [n] \q\ si¬ 

multaneously happen to be q-colorings of will depend on how “similar” a, t are. To gauge similarity, define the 
overlap of a, t as the q x q-matrrx a{a, t) = {atj [a, t)) i.jEiq] with entries 

aij{a,T) - n~^\a~^{i)nT~^U)l 

In words, aij [a, t) is the probability that a random vertex vein] has color i under a and color j under t. Then we 
can cast the second moment in terms of the overlap as follows. Let 5? = l^n.q be the set of all overlaps a{a, t) of 
balanced cr, t : [n] [q\. Though the results of the next lemma can be found in [14], for completeness we provide 

a brief proof here. 
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Lemma 3.3 ([14]). Let\\a\\k- j be the £ ic-norm and define 

H{a) = - aijlnaij, E{a) = Eq,c,k{a) = c\n\\-2q^~’^+ \\a\\^A. 

i,i^W 

LetE{a) - H{a) + E{a) and suppose that a* e 5? satisfies Fla*] = maxaesi E{a). Then 

E\Z'^^^^^]^e^p[nF[a*] +o[n)]. (3.8) 

Next, let ^ be a positive constant and suppose that .saf c has the following property: atj > £, for all a e and all 
i,j E [q]. Then 

E[^q,bai ■ •'^1 = E exp [riFla]]. (3.9) 

a^s4 


Proof. First, observe that for a given aeS&, the number of a, x with overlap a is given by the multinomial coefficient 

' 

fliiu, a\2n,...,aqqn^ 

Next, fix balanced maps cr,T with overlap a. By inclusion-exclusion, the probability that a random edge chosen 
uniformly out of all possible edges is monochromatic under either a on equals 


ykj 


E 

i^[q\ 

= E 


k 


Jljelq] tlji n 

k 


■ E 

i.jelq] 


^ aiju'^ 

k 


! 

k 

1 

k- 

E 

+ 

E 


.U'eW 


xj^lq] 



- E 4 + 0 7- 


ij^lq] 


(3.10) 


To simplify this expression we observe that since a, x are balanced. 


E E ‘^ij - E 

i^lq] \]^lq] !£[?] 




Hj 

I 


l-fc 


■ q'^-'^-kq 


1- E 


a 


‘J 

ij€lq] 


+ 0 


1 


E -- E 

^ J^lq] 


= ^^“^-tO(l/u), 


because aij - 1 and \q ^ - Iljeiq] thjl - 0(n ^^^). Of course, the same steps apply to Z/eiqi 

Hence, since cr and x are balanced, (3.10) can be written as 


2q^~- E 4 + 0 - ■ 


i,j=l 


(3.11) 


Therefore 


E[Z; 


q,ba[^ 


■E 


CliiTl, U\2fl, ■ ■ n 


exp[nE[a] + 0(1)). 


(3.12) 


Let h V 1 denote max{h, 1}. We give upper and lower bounds on the multinomial coefficient by applying Stirling’s 
formula in the form 

1 


h!= \/2:x(hvl) |^| 


1-tO 


h-t 1 
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which holds for all nonnegative integers b. This gives 


yif ^12 n,... , dqq Tl ^ 


[2nn)^^ exp [nH(fl)] J][ (a;yVl/n) 
!,je[g| 


(3.13) 


Since 1/n < atj y IIn<2lq for all i,j e [q], and since each row and column sum equals \!q+o[V), the product over 
i, j e [q] in (3.13) is always bounded below by a constant and (easily) bounded above by Therefore 


Q(^(i q )/ 2 j [nH{a)] < 


n 


anti, a\2n,aqqti^ 


0(1) exp lnH{a)]. 


Combining the above leads to 

q(„( 1 -</^)/ 2 ) ^ exp[nF(a)] <E[Z^ jj^] < 0(1) ^ exp[nF{a)]. 

Taking just the term corresponding to a - a* in the lower bound gives the lower bound of (3.8), and the upper 

2 

bound follows using the fact that \S^\ < n‘^ . 

Next, observe that ii ae si then Yli,]€iq] {(lij v = 0(1). Substituting this into (3.13) and restricting the 

sum in (3.12) to si completes the proof of (3.9). □ 

2 

Let ® c be the polytope comprising of all a = {aij)ij-^q such that 

H - ^ aji-\lq foralliE[q], aij>0 for all /,7 e [< 7 ]. 

Then @ is the Birkhoff polytope, scaled by a constant factor, and n ® is dense in @ as n ^ 00 . Therefore, (3.8) 
yields 

lim — lnE[Z^. „,] = maxF(a). 

n —00 n ‘1’°^ ae@ 

Further, evaluating the function F{a) from Lemma 3.3 at the “flat” overlap a - {aij) with aij - q~^ for all i, j e [q], 
we find 

Fid) = 2[ln^7 + cln(l-^ 7 ^■*')]. (3.14) 

This term is precisely twice the exponential order of the first moment from (3.4). Consequently, the second mo¬ 
ment bound E[Z^j^^] = 0 (E[Zg_bai]^) can hold only if 

F{d] - meoiFia). (3.15) 

aeSi 

In fact, the Laplace method applied along the lines of [18, Theorem 2.3] shows that the condition (3.15) is both nec¬ 
essary and sufficient for the success of the second moment method. In summary, the second moment argument 
reduces to the analytic problem of maximising the function F over the polytope S>. 


A relaxation. This maximisation problem is anything but straightforward. Following [4], Dyer, Frieze and Green- 
hill [14] consider a relaxation. Namely, instead of optimising F over S), they consider the (substantially) bigger 
domain of all a - such that ciij - 1 /^ for all i e \q\ and aij > 0 for all i,j e [q], dropping the 

constraint that the “column sums” X; i^ji equal 1/q. Note that Si’ is the set of singly (row) stochastic matrices, 
scaled by a constant factor. Clearly, max^jegEia) < maxaes’PM- Furthermore, Dyer, Frieze and Greenhill solve 
the latter maximisation problem precisely by generalising the techniques from [4[, requiring rather lengthy tech¬ 
nical arguments. The result is that for c up to the lower bound in (1.1) we indeed have maxaesf F{a) - F{d). 

But this method does not work up to the density promised by Theorem 1.1. There are two obstacles. First, not 
far beyond the lower bound in (1.1) the maximum of F over SF is attained at a point a' eSF\Q), i.e., F{a'] > F{a). 
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Thus, relaxing ® to the larger domain S/’ gives too much away. Second, there exists a constant 7 > ln2 such that for 
c = - 1/2) In - 7 , the value of F attained at 

^stable = - q~’') id + q~’'(q - [q^a - id) e Si 

is strictly greater than Fid). (Note that every entry of the matrix equals 1.) Consequently, even if we could solve 

the analytic problem of maximising F over the actual domain S it would be insufficient to prove Theorem 1.1. 


Tame colorings. The above discussion shows that it is impossible to prove Theorem 1.1 via the second moment 
method applied to A similar problem occurs in the case of random graphs [k = 2), see [9]. To remedy this 

problem in the hypergraph case we will generalise the strategy from [9]. 

The key idea is to introduce a random variable Z^ tame that takes the typical geometry of the set BFS] of all bal¬ 
anced (^-colorings of^ e '^{n, k, cn) into account, such that 0 < Z^tame S Z^^ai- According to predictions based on 
non-rigorous physics considerations [23], the set has a geometry that is very different from that of a random 
subset of the cube [q]^"^ of the same size. More precisely, for almost all fc-uniform hypergraphs G with cn edges, 
the set B{G) decomposes into well-separated “clusters” which each contains an exponential number of colorings. 
However, the fraction of colorings that any single cluster contains is only an exponentially small fraction of the 
total number of (^-colorings of G. Furthermore, while it is possible to walk inside the set B{G) from any coloring 
to any other colouring in the same cluster by only changing the colors of 0 (ln n) vertices at a time, it is impossible 
to get from one cluster to another without changing the colors of Q(u) vertices in a single step. Now, the basic 
idea is to let Z^tame = Z^^^al • where ^ is the event that the geometry of the set B{G) has the aforementioned 
properties. 

To make this rigorous, we define the cluster of a q-coloring cr of a hypergraph G as the set 
'^(G,(t) = ir e B{G) : min aula,!) > q~^{l.0llk)^^^’‘~^^ 

{ ie[q] 

In words, '^(G,(t) contains all balanced ^/-colorings t of G where, for each color i, at least a ( 1 . 01 /fc)^b*;-i) fraction 
of all vertices colored i under a retain color i under t. Call a t^-coloring cr of G separable if 

\/t e B[G), \/i, j e [q], aij{a,T) ^ [q~^[1.01lk)^^^^~^\q~^[l-K)) wheiex - q. (3.16) 

Definition 3.4. A q-coloringa of the (fixed) hypergraph G wtame if 

Tl: a is balanced, T2: a is separable, 73; |'^(G,cr)| < ElZ^^ ^al]- 

Definition 3.4 generalises the concept of “tame graph colorings” from [9, Definition 2.3]. 

The set of tame colorings of a given hypergraph G decomposes into well-separated clusters. Indeed, the sep¬ 
arability condition ensures that the clusters of two tame colorings ct,t of G are either disjoint or identical. Fur¬ 
thermore, T3 ensures that no cluster size exceeds the expected number of balanced colorings, i.e., the clusters are 
“small”. This will allow us to control the contribution to the second moment from colourings which lie in the same 
cluster (see Lemma 5.4). Furthermore, if a,T are tame colorings then the overlap a[cr,T) cannot equal the matrix 
^stable defined above, as this matrix fails T2. So, restricting attention to tame colourings excludes the matrix flstable- 
Let Zj^tame be the number of tame t^-colorings of^(n, k, cn). With the right random variable in place, our task 
boils down to calculating the first and the second moment. In Section 4 we will prove that the first moment of 
Zq.tame is asymptotically equal to the first moment of Z^ ^ai- For the following two propositions we assume that 

(^ 7 *'“^-l/ 2 )ln^- 2 < c< (^*'“^-l/ 2 )lni 7 -ln 2 - 1 . 011 nt 7 /^. 

That is, we consider values of c which lie between the standard second-moment lower bound (on the ^/-colorability 
threshold Cq^k) and the one we prove here. 

Proposition 3.5. There is a number qo > 0 such that for all q> qo we haveE[Zq^tame] ~ E[Z^_bal]- 
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Further, in Section 5 we establish the following bound on the second moment. 

Proposition 3.6. There is a number qo> 0 such that for all q> qo */E[Zq_tame] ~ E[Zq_bai] then 

E[2g,tame] = 0(E[Zg,bal]^) = 0(E[Z^,tame]"). 

Thus, while moving to tame colorings has no discernible effect on the first moment. Proposition 3.6 shows that 
the impact on the second moment is dramatic. Indeed, the matrix flstable shows that E[Z^ > exp(n(?i))E[Zg_bal]^ 

for c near the bound in Theorem 1.1, while = 0{E[Zqh^\]^) for all cup to (t7^“^-l/2)ln(7-ln2-1.011n qlq. 

Then Theorem 1.1 follows from applying Fact 3.1 to Z^^ tame. by Propositions 3.5 and 3.6. 

Finally, the obvious question is whether the approach taken in this work can be pushed further to actually 
obtain tight upper and lower bounds on the ( 7 -colorability threshold. However, it follows from the proof of Propo¬ 
sitions 3.5 that the answer is “no”. More specifically, in Section 4.4 we prove the following. 

Corollary 3.7. For any fc > 3 there exists a sequence [£q)q >3 such that limq^oo £<? = 0 and such that the following is 
true: For any c such that 

[q^~^ - \l2)\nq-\n2 + £q< c< {q^~^ - l/2)lnq 

there exists 5 > 0 such that 

lim P[Zq<exp(-dn)E[Zg]] = 1. (3.17) 

Now, assume for contradiction that there is a random variable 0 < Z < Z^ with the following properties. First, 
Z(G) > 0 only if G is q-colorable. Second, lnE[Z] ~ lnE[Zg] (cf. Lemma 3.2 and Proposition 3.5). Third, E[Z^] = 
0(E[Z]^). Then the Paley-Zygmund inequality implies that 

lim lim P [Zq > exp(-5n) E[Z(^]] > lim lim P [Z > exp(-5n)E[Z]] > 0, 

§—>.Qn^oo 

in contradiction to (3.17). Corollary 3.7 is in line with the physics prediction that the actual q-colorability threshold 
is preceded by another phase transition called condensation [23], beyond which w.h.p. Zq < exp(-0(n))E[Z(^]. In 
particular, the lower bound of Theorem 1.1 matches this “condensation threshold” up to an error term that tends 
to 0 in the limit of large q. 


4 The first moment 

Throughout this section, unless specified otherwise we take cr, t : [n] ^ [q] as balanced maps, and assume that 

(q*'“^-l/2)ln^-2< c< l/2)lnq-ln2-l.Ollnq/^. (4.1) 

We frequently make use of the Chernoff bound. 

Lemma 4.1. ([20, Theorem 2.1]) Let(p{x) = (1 -t- x)ln(l + x) - x. LetX be a binomial random variable with mean 
/i > 0. Then for any t> 0 

P[X > fi+ t] < exp{-p(/)(r/p)}, P[X < fi- t]< exp{-p(/)(-r/p)}. 

In particular, for any t > 1 wehaveP[X> tp] < exp{-tp\n{tl e)}. 

4.1 The planted model 

The aim in this section is to establish Proposition 3.5, the lower bound on the expected number of tame col¬ 
orings. Let a : [n] ^ [q] he a (fixed) balanced map that assigns each vertex a color. It suffices to prove that 


P [cr is a tame coloring of '5^|cr is a coloring of= 1 - o(l). Furthermore, the conditional distribution of given 
that O' is a coloring admits an easy explicit description: the conditional random hypergraph simply consists of m 
random edges chosen uniformly out of all edges that are not monochromatic under a. 

It will however be convenient to work with a slightly different distribution. Let k, cn,a) be the hyper¬ 

graph on [n] obtained by including every edge that is not monochromatic under cr with probability p, indepen¬ 
dently, where 


P = 


cn 


cfc!-(l-i- 0 (l/n)) 






(4.2) 


Observe that the expected number of edges equals cn. We call '^{n,k, cn, cr) the planted coloring model. 
Lemma 4.2. Leta: [n] [^ 7 ] be a fixed balanced map. For any events we have 

Pi'S eS\ a is a coloring of‘^] < 0{\/n) P £ S\ . 


Proof. By Stirling’s formula, the probability that 'Sa has precisely m edges is 0(n If this event occurs then the 
conditional distributions of '^a and of ^ coincide. □ 


Hence, we are left to show that the probability that cr fails to be tame in ’^cs is o{n~ 2 ). Indeed, in Sections 4.2 
and 4.3 we will establish the following two statements. In both cases the proofs are by careful generalisation of the 
arguments from [9] to the hypergraph case. 

Lemma 4.3. With probability \ - exp{-Q.{n)) the planted coloring a is separable in[n, k, cn,a). 

Lemma 4.4. With probability \ - exp{-D.[n)) we have(,'^a, (t)\ <E[Z^_bal]- 


Proposition 3.5 is immediate from Lemmas 4.2-4.4. 

Much of the analysis in this section will involve random variables defined using the following edge counts. For 
sets Xi,X2,X3 cz [n] and a elk], we let maiXi,Xz,X3] be the number of edges e of^Sa such that there exists xe Xi 
and distinct vi,...Va e X2 with x,vi,...Va e e and e\{x,vi,...Va] ^ X3. If cr = fc- 1 then we write m^-i (Xi, X2) 
instead of mk-i{Xi,X2,X3), since X3 has no effect in this case. For ease of notation, if Xi - {v} we simply write 
ma{v,X 2 ,X 3 ), or mk-i{v,X 2 ). We set Vi - a~^{i) to ease the notational burden. 


4.2 Separability: proof of Lemma 4.3 

Let T : [n] ^ [c/] be a balanced map which is not separable: that is, for which there exist i,] e [^7] such that (3.16) is 
violated. Of course, we may assume without loss that i- j -1. We aim to show that t is unlikely to be a coloring of 
^cr- Clearly, if t is a coloring of '^a then (1) is an independent set of size about nlq that has a rather substantial 
intersection with the independent set cr“^(l). Here, as for graphs, an independent set is a set of vertices which 
contains no edge. The following lemma rules this constellation out for a wide range of intersection sizes. 

Lemma 4.5. With probability 1-exp{-D.in)) the hypergraphhas no independent set I of order (l+o]!))^ such 
that 


cr"^(l)| e (c7"^(L01/fc)^''*'“i\ c7“Hi - . 

Proof. Suppose that I is an independent set with |7| = -^(l-i- o(l)) such that S = /nCT“'^(l) contains |S| = ^ vertices, 

q q 

for some 5 e (0,1). Then the set 

VofS) := [ve Vycr'Hl): mk-i[v,S) = 0} 
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contains I\S. Observe that 


P[mk-i{v,S) = 0 ] = expt -p 


|S| 

yk-lj 


k-l 


(1 + 0(1 / n)) = exp ^ 


(l + 0 (l/n)) < 2^7 


-ks^ 


it suffices to prove that when ( 1 . 01 /fc) 


i/(fc-i) 


S c O’ Hi) of size sniq with no{S) > (1 - s + o(l))- 


< s<l-q 


ineq 

(i.oi-i:)/ 2 ^ with probability 1 - exp(- 0 (n)) there is no subset 


Let rioiS )| Vb(S)|, and observe that in order for I to exist, the inequality no(S) > (1 - 5 + 0 ( 1 ))-^ must hold. Thus 


Since no(S) is stochastically dominated by Bin(|y\(T ^{l)\,2q *), we have by the Chernoff bound (see Lemma 

4.1) that 

1-5 


P «o(S)> (l- 5 +o(l))- <expt-(l-s+o(l))-ln 


Zqt-ks'^-^ej 


The number of choices for a subset S of cr ^ (1) of size snIq equals 

e \il-s+o(l))nlq 

< I- 

snl q 


(tH)' 


: exp 


{(l-s+o(l))|(l-ln(l- 5 )}, 


as established in [9, equation (A.5)]. Hence, by the union bound over S, the probability that such a subset S exists 
with the desired lower bound on no(S) is at most 


exp-| -(l- 5 + 0 ( 1 )) — 


In 


1-5 




-l + ln(l-5) 


= exp (l - 5 + 0 ( 1 )) — In 


2e^ 


\qks’‘-^-t{l-s)2j 


This probability tends to zero if and only if 


V2e 

q(t-ks>^-hl2 


<1-5. 


(4.3) 


By convexity, the exponential function on the l.h.s. intersects the linear function on the r.h.s. at most twice, and 
between these two points of intersection the linear function is largest. For sufficiently large q, explicit calculation 
shows that the values 5 = (1.01/fc)'^^^*^”^' and s = 1 - qh-Oi-k)!! g^^sfy (4.3)^ completing the proof. □ 


Lemma 4.5 does not quite cover the entire interval of intersections required by (3.16). To rule out the remaining 
suhinterval (^“^(l- ^ 7 *'^“^'^^), ^/“Hl -x)) we use an expansion argument. The starting point is the observation that 
most vertices that have color 1 under t hut not under a are likely to occur in a good number of edges in which all 
the fc - 1 other vertices are colored 1 under a. We have not attempted to optimise the constants in this lemma. 

Lemma 4.6. LetT: [n] [q] be a balanced map such that ania ,t] e (^“'^(l - ^“'^(l - k)). Withproba- 

bilityl-exp{-Q.{n)), the random hypergraph ^‘S{n,k,cn,a] has the following properties: 


1. The setY{v eV\a : mk-i{v,a ^{1)) <15} has size at most nx I {3 q). 

2. ThesetU t“H1)\(o'~H1) u Y) satisfies \ T“'^(l),cr“'^(l)) < 5 |(t“H1) \ t“'^(1)|. 

Proof. By assumption, |(t“H 1) n t“H1)I = ^ where s e (1 - qh.tit-k]i 2 ^ l - jc). Fix veVWi. We know that 


mk-i[v,Vi) ~ Bin 


ll^il 

k-l 


P 


Therefore 


14 HIHIH 

P[mfc_i(t;,yi)<15]< ^ 

7=0 J , 


pHl-pYk-ii ■'<(i-p)U-il 




j=o 
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Combining (4.2) with the lower bound from (4.1) shows that p > fcln q, which in turn implies that 

P [mic-\{v, Vi) < 15] < 3(A;ln^7)'^^ q~^. 

As the event {mjc-i{v,Vi) < 15} occurs independently for all v e VWi, the total number Y of such vertices is 
stochastically dominated by Bin(n(l - 1 /( 7 ), 3 (A;ln^ 7 )'^^ t 7 “^). Therefore E[F] < q~^. Finally, by the 

Chernoff bound (see Lemma 4.1) and using the definition of k from (3.16), 


P[F > nKl(3q]] < expinxI{3q)} - exp{-n(n)} 


and so the proof of (/) is complete. 

For notational convenience, we write R = (T“'^(l)\T“ni) and T = T“'^\(T“ni)- Observe that mi{U,R,Vi) is 
stochastically dominated by mi{T,R,Vi), since (7 is a subset of T. Furthermore, mi{T,R,Vi) has the binomial 
distribution Bin|| T\\R\ ([^ 2 )’ p] ■ Therefore 


E[mi(r,i?,i/i)]<iri|if| 


k-2 


n(l-s) 9 

p < (1 + o(l)) |if| -- -n ^k^qlnq 


= (l + o(l))|if|-(l-5)A:'^ln(7. 


Finally, asl-jc<l-s< part (ii) follows from the Chernoff Bound. 


□ 


Proof of Lemma 4.3. Suppose that t is a balanced map such that ( 2 ii((t,t) > (7 '^(1.01/fc)^^*^ By Lemma 4.5, we 
may assume that aii{a,T] > q~^{l- With U, Y as in Lemma 4.6 we have that 

15|[/| < miiU,R,Vi] < 5|cr"i(l) \T“i(l)|, 


so|L1|<||(7 ^(1)\t ^(1)|. Also, since t is balanced, we have 77 ~ |t ^(1)| = 1(t '^(l)nT ni)l + |L/| + |F|. Substituting 

3 q 

our bound on U from above, and using Lemma 4.6, implies that nan{a-,T) - 1(t“^(1) n t“^(1)| > n{l-K]lq, as 
required. □ 


4.3 The cluster size: proof of Lemma 4.4 

To upper bound the cluster size we wiU exhibit a large “core” of vertices of ‘Sa that are difficult to recolor. More 
specifically, the core will consist of vertices v such that for every color ifa{v) there are several edges e containing 
V such that e\{v}cz Vi and such that all vertices of e belong to the core. Therefore, if we attempt to change the color 
of L’ to i f a {v), then it wiU be necessary to recolor several other vertices of the core. In other words, recoloring a 
single vertex in the core leads to an avalanche that will stop only once at least n£ 7 “^(L 01 /fc)'^^^*^“^^ vertices in some 
color class have been recolored. Hence, the outcome is a coloring that does not belong to 

Formally, given a fixed balanced map a and fixed hypergraph G, the core Vcore of G is defined as the largest 
subset V' c [n] of vertices such that 

mk-i{v,VinV')>l00k for all veV' and all ifa[v). 

The core is well-defined; for if V', V" are sets with the property, then so is V u V". 

Lemma 4.7. With probability 1 - o{rr^'^) the random hypergraph^a has the following two properties: 

(i) The core of^a contains at least {\- q^~^ ^ vertices. 

(ii) Ifr is a balanced coloring oftSa such thatr{v] f a{v) for some v in the core, then t 

We proceed to prove Lemma 4.7. To estimate the size of the core we consider the following process: 
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CRl For i,j e [q] and i ^ j, let Wij - {v eVt: rrik-iiv, Vj) < 300fc}, Wi - W = UiWi. 

CR2 For i ^ j let Utj = {veVi: mi{v,Wj,Vj) > lOOA;}, andU = Ui^jUij. 

CR3 Set = U and repeat the following for / e N, 

• if there is a y e Vj\Z^^^ such that mi{v, > lOOA; for some j ^ a{v] then take one such v and let 

• otherwise, set = Z^^\ 

Let Z = U/>oZ^^^ he the final set resulting from CR3. 

Claim 4.8. The setV\{W \J Z) is contained within the core. 

Proof. For a contradiction, let v e 1^ \ [W u Z). Since Wj c Vj, any edge counted hy mjc-iiv, Vj) that does not 
contribute to mi[v, Wj,Vj] must have empty intersection with Wj. Since Vj) 5 300fc hut m\{v, Wj, Vj) < 

look, we must have that nik-iiv, VjXWj) > 200A:. Similarly, since v ^ Z we have 

mi(i',Zn Vj,Vj) < mi[v,Z,Vj] < lOOfc, 

and therefore nik-iiv, Vj\iWji}Z) > 100 A:. Furthermore, this statement holds for all j f a[v) and all v e V^\(FFuZ). 
It follows that the entire set 1/ \ (VFu Z) may he added to the core, which contradicts maximality unless 1^\ {W u Z) c 
Vcore: as required. □ 

We now hound the size of W, U and Z. 

Claim 4.9. The function Q{q,k) = q issuchthat\Wij\ < n-Q(q,k) withprobabilityatleastl-exp{-D,{n}}. 

Proof. Fix V e Vi. Due to the independence of edges in '^{n, k, cn,a) we know that m]c-\{v, Vj) is distributed bi- 
nomially with mean (|.'^|)p(l + o(l)) > klnq+ Oq[q~^). It follows from Lemma 4.1 that P(t; e Wij) < j-Q[q,k) 
for V E Vi and sufficiently large q. Therefore E[|lV;j|] = ^ • Q{q,k). Finally, since \Wij\ is distributed binomially, 
a straightforward application of the Chernoff bound shows that P[| Wij \> n - Q{q,k)] < exp{- n ■ Qiq, k) ln(3/ e)} = 
exp{-n(n)}. □ 

Claim 4.10. We have\U\ < nlq^^^ with probability at least l-exp{-D.{n)]. 

Proof. Fix V eVi. The quantity mi{v, Wj, Vj) is stochastically dominated by Bin|| p) ■ Hence, with Qiq, k) 

as previously, we know that 


E\mi{u,Wj,Vj) \Wj\<n-qQ{q,k)\ < nkp 


k-2 


■qQiq,k)^Oqiq^ ^) 


Applying the Chernoff bound gives P [j; e Uij \ \ Wj\< n - qQiq, A:)] < Oqiq The independence of edges in 
'^in,k,cn,a) then implies that Uij is distributed binomially with mean less than n ■ Oq iq~^°^) . The Chernoff bound 
implies that 


P^\Uij\>nq ^^*^1 \Wj\<n-qQiq,k)^<exp{-klin)}. 

The result follows by Claim 4.9. □ 

Claim 4.11. We have\Z\ < niq^^ with probability at least l-exp{-D.in)]. 
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Proof. Claim 4.10 tells us that \ U\< nl with prohahility 1 - exp{-Q(u)}. We will condition on this event. Sup¬ 
pose that \Z\U\ > i* - nlq^^^ and consider the set Z*' ^ obtained after i* steps of CR3. The construction of Z 
implies that there exists 100fc|Z** ^ \ U\ vertex-edge pairs [u, e) such that en Z > 2 and e\{v}Q Vj for some j e [q]. 
Since each edge may appear in at most k vertex-edge pairs, this implies that there are at least 100 |Z*‘ ^ \ U\ such 
edges. Therefore, there are at least lOOi* = 100u/^7^'’*^ edges e such that en Z*‘ ^ > 2 and e\{v} c Vj for some 
j elq],v£ e, despite the set Z** ^ only being of size at most 2nlq^^^. We prove that with high probability, no such 
set can exist. 

Let a - q~^°^ and let T c [n] be a set of | r| = an vertices. Let nij be the number of edges e such that e n T > 2 
and ellf} c Vj for some j e [q], v eV. We know that is stochastically dominated by Bin^2a[2){^^_‘^2)’P]’ 
we may observe by the Chernoff bound that 

P[mj > 100- an] < expllOOaulna}. 


If we let N be the number of sets T of size | T\ = an such that mt > 100- an, then 


P[Ar>o] < 




yanj 


expllOOaulna} < exp|-|alna -i- (1 - a)ln(l - a) - lOOaln aju| = exp{-0(u)}. 


The final bound holds since a is constant with respect to n and a e (0,1). Therefore with probability l-exp{-0(u)} 


we have \Z\U\ < nlq^^^, which implies the claim. 


□ 


Lemma 4.7 (i) then follows from Claims 4.8-4.11. 

To establish (ii) we say that a vertex v is j-blocked if there is an edge e3 v such that e \ {v} is contained in the 
core and e\{L’} c Vj. We say that a vertex v is a - complete if \t is y-blocked for all j f a{u). Note that, as with vertices 
inside the core, recoloring any cr-complete vertex will set off a coloring avalanche. Therefore, the cluster size is a 
result of vertices that fail to have a neighbour in the core (i.e. the cluster size is a function of the number of 1 -free 
vertices). 

Claim 4.12. With probability I-exp{-D,(.n)] the random graph‘Sa has the following property: 

ifr E ifSa .O’) then for all a - complete vertices v wehavea{v) - r{v). 


Proof Note that it suffices to prove that a[v) - t[v) for all v in the core, since this implies the result for all cr- 
complete vertices outside the core as well, by definition of cr-complete. 

Recalling Lemma 4.3, we may assume that a is separable in '^{n, k, cn,a). For i e [q], let 
At = {yE Vcore'-t(v) = * f O-(y)}, A7 = {v £Vcore'-t{v) f / = cr(l')}. 


Then 


i=l 


.:a{v)fT{v]}\ = |. 

! = 1 


(4.4) 


The assumption that cr is separable implies max/e|q] |At| < ^k{1 -i- o( 1)) and max/£|q] |A^. | < -i- o(l)). If we 

can show that {v e Vcore -ctivif t{.v]] = 0 then a{v) = t(v} for all cr-complete vertices. 

Take v e At. Since v e Vcore we know that mt-iiv, Vi]> 100. Further, since x is a coloring, we must have that 
mi{v,AT, Vi)> 100. Let \A'T\- fn and observe that 


' 1.01 UK 

P[mi{At,AT,Vi)>fn]< ^ 


fn 


pn 


<2 


2.8 nK\Vi 


\k-2„\ 


q[k-2)\ 




< (Sfc^Tclnc/)^” = exp{-Q(u)}, 


by definition of k. This implies that for all i e [q], 


W0\A^\<mi[AlAT,Vi)<\AT\. 


(4.5) 
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Therefore, (4.4) implies that At = A^. =0 for all i, as required. 


□ 


Lemma 4.7 {ii) is immediate from Claim 4.12. 

The core size guaranteed hy Lemma 4.7 is not quite big enough to deduce a good bound on the cluster size 
(due to the polylogarithmic factor). To remedy this problem, we say that a vertex v is j-blocked if it is contained in 
an edge e such that e \ {t/} is contained in the core and e\{u] cVj. Further, we say that v is a-free if there are at least 
q:+ 1 colors j (including aiv)) such that v fails to be 7 -blocked. A careful study of how the vertices outside the core 
connect to those inside yields the following. 

Lemma 4.13. With probability 1 - o(n~^^^) there exists a set Aw of vertices such that there are at most + 

Oq{q~^)) vertices outside of Aw which are 1-free and there are at most nq~^{\ + Oq{q~^)] vertices which belong to 
Aw or are 2-free. 

We proceed to prove Lemma 4.13. Let 


Ai^{vEV\Vi-mk-i[v,Vi)^Q}, 


and define 

^0 = u ^00 = U n ^j) - Az-{vEV:mi[v,Zn Vi, Vi) > 0 for some i a{v)}, 

/E[fci iyj 

Aw - {veV: mk-iiv, Vi \ Wi) - 0 for some i ^ crli;)} \ Aq . 


By construction, if v is 1-free then y e Aq U Az U Aw, and if v is 2-free then v e Aqo U Az U Aw- Thus, if we can 
bound the size of these sets that we will obtain the estimates required by Lemma 4.13. 

Claim 4.14. We have\Ao\ < nl q^~^ and\AaQ\ < nlq^^~^ with probability at least 1-exp {-D-in}}. 

Proof Take v e Vj, and i j. Now P [mk-ilv, Vi) = 0] < expl-fclnq}, and hence P [y e Aq] < (q- l)q~^- It follows 
that E[|Ao|] p< n-{q- l)q~^- Since P [mfc_i(i;, Vi) = 0] > exp{-(fc-i- l)lnq} we must have that p > nq~^ and so by 
the Chernoff bound 


P I |Ao| > nlq^ ^ < exp 


-nq 


-k 


-In 


q-\ \q-ll q-1 


exp{-n(«)} 


as desired. Further, the argument for Aqo follows quickly after noting that the edge sets of mjc-i [v, Vi) and mjc-iiv, Vj) 
are independent for i f j. □ 

Claim 4.15. We have\Az\ < nl q^^ with probability at least l-exp{-D.{n)}. 


Proof. Take ve VWi. Observe that mi (f.Zn V), 1/;) is bounded above by milt;, Z, V)), which is stochastically dom- 
inated by Bin 11Z | (['^ 2 ), pj. Therefore 


P|mi(t;,Zn V;,V;)>0 I \Z\<nlq^’^\ <l-(l-p)4 


_n_,\ViU 
„9k Vk-Zl 


ck^ 


ifc-i 


a-q^-’^) 


■~\Vi\^-^^q-^K 


It follows that conditional on the event |Z| < nlq^^ we know that jAzI is distributed binomially with mean less 
than nl q^^. Finally, application of Lemma 4.1 implies that 


P [|Azl > nlq^^ I |Z| < < exp{-Q(n)}, 


so the result follows from Claim 4.11. 


□ 
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Claim4.16. Wehave\Aw\'S: n! ^ with probability at least l-exp{-D.{n)}. 


Proof. Fix if j. We begin by developing probabilistic bounds for mjc-i (.Vi, Vj) and m\(Vi, Wj,Vj). As mic-i(Vi, Vj) 


in(|V'd(|.';^'|),p),wehaveE[mfc_i(l/i,V/)] 


Bin 


— - » P 


(1 + o(l)), and so by Lemma 4.1 


mk-i{Vi,Vj] > — 


i- 

n p 


fc - 1/2 




> 1 - exp < -p 


1 


1 - — In 1 -— + — 

2fc) I 2fc) 2fc 


1 


Since m\(Vi, Wj, Vj] is stocbastically dominated hy Bm^\Vi\\Wj\[^^f]^, p^, we 


bave 


> 1 - exp{-Q(n)}. 


(l + ollDu'^ ^p\Wj\ , . 

E [m,(Vi, Wj, Vj)] < -- L . 

Let ^ denote tbe event that | Wij \< n- Q(q, k) for all i f j, where Q(q, k) = Oq{q~^~^). By Claim 4.9, this event fails 
with exponentially small probability. Therefore 

r , [I + o(l)) p (q -1) , 1 . p r. 

E[mi(Vi, Wj, Vj)] < ^^ ■ q^-'^ Q(q, k) + 7 ^—^ q-’^ exp{-Q(n)} 


{k-2)\ 

<- q^ ^ Q(q,k). 

(k-2)l ^ ^ ^ 


{k-2)\ 


In the first line we apply the law of total probability, conditioning on the event that ^ holds in the first term, and 
on the event that ^ fails in the second term, using | Wy | < | Vj \ in that case. Applying Lemma 4.1 gives 

P Vj) < Bn^p- q^~^ Q(q, fc)j = 1 - expl-Llfu)}. 


For ease of notation, let 

and define 


ro = 


T = mk-i(Vi, Vj), Y = mi(Vi, Wj,Vj) 
n^pik- 1 / 2 ) 


q^k\ 


YQ^2n^p-q^~^Q(q,k). 


We will condition on the event 8 that T > Tq and 7 < Yb- 

We now appeal to a balls and bins argument. Think of the edges that contribute to Y as yellow halls, those that 
contribute to T - 7 as blue balls, and tbe vertices of Vi as the bins. Let be tbe set of vertices (bins) in Vi that 
receive at least one hall, hut do not receive a blue ball. Fix a positive integer m < 10, and suppose that t> Tq. If r 
balls are thrown into | Vi \ bins, the probability that an individual bin receives precisely m balls is given by 


ft^ 

\mj 


17,1“'” (l-\Vir^)‘~'" < 2(tqln)"'exp{-tqln} < 2q^'‘^~'^ k"' In'” q. 


Given that vertex v e Vi receives a total of m balls, the probability that all its halls are yellow is equal to the prob¬ 
ability that a hypergeometric random variable with parameters T, 7, m takes the value m. (Recall that the hyper¬ 
geometric distribution Hyp(r, 7, m) counts the number of yellow balls drawn after m draws without replacement 
from a set of T balls, of which 7 are yellow and the remainder are blue.) This probability increases as 7 increases 
and as T decreases. Summing over m > 1 and conditioning on the event 8 gives 


P[ve^ij\8\< ^ P [HyplTo, 7o, m) = m] x P [y receives m balls] 

m>l 

f27o''” 


IsmslO 


To 


^ E hr k"‘^n"‘q+ Y. 


m>10 


To 
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Hence E 11 < n • Finally, write ‘Wij as the sum of indicator variables Xp for v e Vi, where Xy = 1 if 

V E ^ij and Xy = 0 otherwise. Azuma’s inequality [20, Theorem 2.25] implies that 

> l-exp{-n(n)}. 

Recalling P[<f] > 1 - exp{-n(n)} and taking the union bound yields Aw ^ q^~^^ with probability 1 - exp{-D(n)}. □ 
Thus Lemma 4.13 follows from Claims 4.14-4.16. 

Proof of Lemma 4.4. We know from Claim 4.12 that for all (7-complete v and all t e we have t(u] = cr(i'). 

If we let Fx be the set of x-free vertices, then by Lemma 4.13, we may assume that 

lFi\Awl<-^ + n-Og[q->^-^], IF2\Awl = n-, lAwl<n-q->^. 

We know that for any ve Fx there are at least x -i-1 choices for the color of v. Since F;i:+i c Fx it follows that 

Y^i'Sa O')] < • (^l^2uAivi 


and so 


1 ln2 Inq ~ 

-\n\^FSa,CT)\<—j—r+—Y + Oq[q ^ 
n q‘^~^ q'^ 

Further, if we set c = [q^~^ -\l2]\nq- ln2 - then 

1 In2 l.Ollnfl ~ 

-lnE[Zo,bal] =ln^7+cln(l-^7^ + - ^ + Oq{q ^). 

n q'^~^ q'^ 

Hence by Lemma 4.2, T3 holds in '^{n, k, cn) with probability 1 - o(l), completing the proof of Lemma 4.4. □ 

4.4 Proof of Corollary 3.7 

Here we assume that 

{q^~^ - \l2]\nq-\n2+ \l\nq< c< q^~^ -\l2]\nq. 

The proof of Corollary 3.7 is similar to the proof of [9, Proposition 2.1]. The starting point is the following ob¬ 
servation, which is reminiscent of the “planting trick” from ]1]. Call a : [n] ^ [q] e-halanced for some c > 0 if 
maxieiq] n!q\ <En. 

Claim 4.17. Suppose there exist e,e' > 0 and a sequence [Sn]n of events such that for large n and all e-balanced 
a\[n]^ [q] we have 


PYSa^Sn] < exp(-e'n), 
lim PYSegn] = 1- 

n—*oo 


Then there existsS > 0 such that w.h.p.ZqPS) < exp[-Sn)E[Zq['^)]. 


(4.6) 

(4.7) 







Proof. Let Z^(G) be the number of £-balanced < 7 -colorings of G. By [14, proof of Lemma 2.1] there exists a > 0 such 
that 

nZqm-Z'^m]<exp{-anmZqm]. (4.8) 

Further, let Z”{^) - e S„}. Combining Lemma 4.2 and (4.6) shows that 

ElZ'^im < exp{-E'nl2mZ'qm]. (4.9) 

Moreover, let = {Zqi'S) > expl-dnlElZ^^C)^)]} for a small enough 5 > 0. Combining (4.8) and (4.9), we obtain 

exp{-dn)ElZqm] Pe n Sn] < E[Zqmm e n S„}] 

< ElZ'^mi + E[Zqm - Z'^mi < (exp(-£'n/2) + exp(-an))E[Z^(^^)]. 

Hence, choosing d > 0 small enough and recalling (4.7), we obtain P [s^„] - o(l). □ 

Thus, we are left to exhibit a sequence of events as in Claim 4.17. Given a map t :[n] ^ [q] and a hypergraph G 
on [n] let Er{G) be the number of monochromatic edges of G under t. Further, for /I > 0 let 

Zq,p{G) = ^expf-jSErCG)), 

T 

where the sum ranges over all t : [n] ^ [q]. The function f ^ Zq^p{G) can be viewed as the partition function 
of a hypergraph variant of the “Potts antiferromagnet” from statistical physics. We consider this random variable 
because it is concentrated in the following sense. 

Claim 4.18. For any e > 0 there is6 > 0 such that for anya : [n] [q] we have 

P[\lnZq^p{^) - E\nZq^l3[‘S)\ >En] <exp{-Sn), P [|lnZ(,,^(c^o-) - ElnZ^^^CS^o.)! > £n] <exp(-dn). (4.10) 

Proof Either adding or removing a single edge alters the value of InZ^^,^ hy at most f. Therefore, the assertion 
follows from a standard application of Azuma’s inequality. □ 

Additionally, we have the following estimate of ElnZ^j 3 (')^o-)- 

Claim 4.19. There is6>0 such that for all f>0 and all S-balanced a we haueElnZqpi'^o-) > 6n-\-lnE[Zq{^)]. 
Proof. We are going to show that for a small enough d > 0 we have w.h.p. 

n-HnZqi^Sa) > q^“^ln2+ 0^(q-^). (4.11) 

Since Zq^^FSa) ^ ZqFSa) for all f and because (3.4) implies that 

n-HnElZqm] < qi-*^(ln2- (21nq)-i) + Oqlq-'^), 

the claim follows from (4.11). 

To prove (4.11), we let Pit be the set of vertices v e V) such that Vj) = 0. Further, let FF he the set of 

all V e Fij such that mk-i{v,Vh) = 0 for some he [q]\{i,j}. Due to the independence of the edges, |Fj;|,|FL| are 
binomial random variables. The expected sizes of these sets satisfy 

E|F,y| = {q-'‘-^ + dq{q-’^-^])n, E[F’^j\ = dqiq-'^-^)n. 

Hence, the Chernoff bound implies that with probability 1 - exp(-Q(n)), for all i, j we bave 

\Fij\ = iq-'‘-^ + dq{q-'‘-^))n, \F[j\^ Oq{q-^-^)n. (4.12) 
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Let F* = UiV; Pij \ P'lj- Further, for every vertex y e F* let e [q] be the (unique) color such that v e Pa[v)ai,[v)- 
Further, let F* be the set of edges e of such that there exist f, iv e e n F* such that 

a{e\{v, w}) c {a{v],a{w],(Ti,{v),ai,{w]}. 


The random variable |F* | is stochastically dominated by a binomial random variable Bin(cu, po) where 




21F 




So by (4.12), 

E|F*| = 2(q^“^+0^(q“''))^(4q“^)*^“^cu + exp(-O(u)) = Oqiq^~'') n. 
Then, the Chernoff bound, we find that with probability 1 - exp(-0(u)), 

|F*| = 6^(q-*^) n. 


(4.13) 


Now, let Fo be the set of all vertices y e F* that do not occur in any e e F*. Then by construction any map t : [u] ^ 
[q] such that T(y) e {CT(y),(7*(y)} for all y e Fq and T(y) = CT(y) for all y ^ Fq is a < 7 -coloring of '^a- Furthermore, 
there are 2'^“' such r and (4.12), (4.13) entail that [FqI > {q^~^ + Oq{q~^))nw.h.p., whence (4.11) follows. □ 


By comparison, InElZ^^^pj^)] is upper-bounded as follows. 

Claim 4.20. For any 6 > 0 there is Po>0 such that for all p> po we havelnElZq^pi'S)] < 5n + \nE[Zq{'^]]. 


Proof. Using (3.4) and the fact that monochromatic edges are least likely when t is balanced, we obtain 

— \nE[Zq{'^)\ = lnq-i-cln|l - q^~^^ + o(l), —lnE[Zq^pl'^)] <lnq=cln|l - q^~^{l - exp(-/i))j. 
Making f sufficiently large and taking logarithms, we obtain the assertion. □ 


Finally, we know from Claims 4.19-4.20 and Jensen’s inequality that there exists d > 0 such that ElnZq_^(^) + 8n< 
ElnZq_^(^cr). Flowever, Claim 4.18 implies that both lnZq_^(^) and InZ^^^C^^o-) are close to their expectations. 
Therefore Corollary 3.7 follows by applying Claim 4.17 to the event 

S„ = {G: |lnZq,/ 3 (G)-ElnZ^,^(^^)| >En}. 


5 The second moment 

In this section we prove Proposition 3.6. We keep the notation and the assumptions of Section 3 and Section 4. 

5.1 Overview 

We reduce the problem of estimating E[Z^^^g] to that of optimising the function F{a) from Lemma 3.3 over a 
certain domain ®tame- Due to the additional constraints imposed by the “tame” condition, this domain S>tame is 
a relatively small subset of Qi, which was the domain of optimisation for (3.15). In the end, F{a) will 

be seen to be significantly smaller than max^jegiF(a), and additionally, the problem of maximising F over 3>tame 
technically less demanding. 

To define S'tame formally, call aeS> separable if aij t (q“^(1.01/ q~^[l- jc)) for all (,7 e [q] (cf. (3.16)). 
Additionally, we say that a e Si is s-stable if there are precisely s pairs [i,]) such that aij > q~^ [f.Ofl We 

denote by Ss the set of aU s-stable a eS, and by S>[q-i] = Us<qSs. Geometrically, each Sg is close to a (q- si- 
dimensional face of the Birkhoff polytope, for if a has entries greater than {\.Q\l then by separability 
these entries are in fact at least {\-k)1 q (with k - ln^° qlq^~^). Finally, let @tame be the (compact) set of all aeS 
that are separable and s-stable for some 0< s< q. 
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LemmaS.l. IfmaXae&,„^^F{a) = F{d) thenE[Z^^^^g] = 0(E[Zq_h^f). 

The proof of Lemma 5.1 is by a standard application of the Laplace method. We defer the details to Section 5.2. 

In order to prove that F{a) - F{d), we observe that the set @tame naturally decomposes into a num¬ 

ber of disjoint subsets. Namely, let 3>s,tame be the set of all s-stable a e ®tame for 0 < s < k. We will argue that for 
1 < 5 < the maximum of F over @s,tame is not much greater than the function value attained at certain canonical 
points dis] with entries 

dijis) = q~^ l{i = j}l{i < 5}-i- l{i > s}l{j > s}. 

Hence, dls) is a block-diagonal matrix. The upper-left block is the s x s identity matrix, divided by q, and the 
lower-right block is the {q- s) x {q- s] matrix with all entries equal to [q[q- s))“^ Clearly, dis) e Sis,tame- 
The following statement, which we prove in Section 5.3, is the heart of the second moment analysis. 

Lemma 5.2. We have metXaeSio tame ^ r«ndmaXae@j,,„„gF(a) < F{d{s]) + q^-^^^~^for any 1< s< q. 


5.1.1 Proof of Lemma 3.6 

By Lemma 5.1 and Lemma 5.2 it remains only to show that 

F{dis)) -I- for all 1 < s < q. 

To see that this is the case, we note that 


iT(fl) = 2lnq, 


and 


21n2 .I. 

E{a) --2lnq—■—+ o[q^ *) 
q^~^ 


and thus 


(5.1) 


21n2 , 

—j^ + o[q^ ''). 
q'^ ^ 


(5.2) 


Moreover, if 5 e [q] then 


q- s 


H{a{s)) = — Inq-t- 

q q 


\n{q{q-s)], E{a{s)] <-2lnq —lnq-i-Oo(fl^ 

q 

s-2q (q-s)^ 


q-s 


Fla{s)] = — Inq-t 

q q 


and thus 


ln(^7(^7 - s)) -I- cln 1 -I- 


q'^iq-sY 


<\nq+— —^ln(q-5)-i-0(,(q^ 


(5.3) 


+ \{q^ ^ - l/2)lnq-ln2] 


s-2q {q-s) 

- — + 

q^ 


q^(q-s)^ 


\ ^ s-2q iq- s) 


= {I - s/q]ln{l- s/q] + 


21n2 sln2 5lnq Inq 
q'^ 

q^~^lnq 


2{ q'^ 
Inq 


q^{q-s)^ 


2q^ q^ 1 q^ ^{\-slq]^ ^ 


s-2q {q-sY 


q^ q^{q-sY 


+ Oq[q^ 


(5.4) 


whence (5.3) and (5.4) follow. Finally, (5.1) follows from (5.2) and (5.4). 


5.2 The Laplace method: proof of Lemma 5.1 

We seek to show that there exists some positive constant C{q) such that 

C(q)-E[Z^,bal]^ (5.5) 

The expected value of can be written as a sum over pairs of tame colourings. We will break this sum into 

several components and deal with each separately. First we estimate the contribution resulting from a near d by 
performing a Taylor expansion of F around d. 
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Lemma5.3. There exists C(q] andrjiq) such that withS - {a e 5? n S>tame -Wa- ti\\^<q{q]} tue have 

E[Z^q ,tame * ls]<C(q]-E[Zq,i,alf 

Proof. We may parametrise n ®tame as follows; disregard the [q, q) entry and consider each matrix aasaq^ -1 
dimensional vector. Let 


££:lO,llq] 


r-i. 


[0,11 q]‘’ , 


j 


if (hi) f iq, q), 


i-L 


a.mq.q) ^ij otherwise. 


We compute the Hessian oiFoT£-Ho^ + EoT£. For (/, j] f [a, b] we have 


da, 


-{Ho£^?{a)]\ =0, 

4 \ a=a 


da^j 


\a=a 


dclijdclab 


{Ho£^{a))\ =-q^. 

I a=a 


Further 


dai 




= 0 , 


^4 




2k{k-l) 


^2k-4 


daijd a^i) 




k{k- 1 ) 




2k-A 


and so 


-^{Eo££{a]]\ = 0 , 

daij I a=a 


daf 

t] 


{Eo^{a])\ 


2ck{k-l) 




d — (Z 

ck[k- 1 ) 

^2fc-4^ _ ql-k'^2 


2k-4 






dai^dUfii) 

Thus, we have that the first derivative ofPoT£ vanishes at a, and that the Hessian is 

2cA:(A:-l) 


D^[FoT£{a)]\^_^^-q^\\- 


^2(fc-l)Q _ ql-k^2 


(id+ 1 ) 


where 1 is the matrix with all all entries equal to one, and id is the identity matrix. As id is positive definite, 1 is 
positive semidefinite and c < q^~^\nq we have that the Hessian is negative definite at a. Further, it follows from 
continuity that there exists some independent of n such that the largest eigenvalue of D^[F o if] is smaller 
than for all points \\a- a\\ 2 < q. Since 5^ is linear, there exists some positive q, independent of n, such that for 
all a such that \\a - a \\2 < q we have “ ^Il 2 ^ Taylor’s theorem then implies that there is some positive 
independent of n, such that 

- 2.2 


Fo£^[a)<F{a)-^ ^ {atj-q'^) 

q.Miq.q) 


for all < 2 : II < 2 - < 2||2 < q. 

As S satisfies the conditions required for the event sd in Lemma 3.3, we may apply (3.9) with ^ - SVa obtain 


E[^^,tame•!<?] = exp{nF(a)}• ^ exp^ (ajj-q^Y 

a&S I {i,j)^{q,q] 

< exp{nF(d)}-0(n'^“'’^^'^)- f , expi-fn T {Zi, - q~^f\ dzi 

>'«'' ' I H.jmq.q) J ' 

2 r 

< exp{nF(d)} • • / exp{-^nz^}£fz 

L^/oo 


q^-l 


< C(^) -ElZg^bal] 

for some constant C{q) depending only on q. Here the final inequality follows from (3.3). 


□ 
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There are two remaining cases to consider, namely a e \S and ae3>q. We begin with the latter. 

Lemma 5.4. There exists a constant C{q) > 0 such that 

E[Zq ytame ‘ — C{q)' E[Zq i,iii] 

Proof. We calculate 

E[4tame-H]= E P[(T,T are tame] = ^ 7 ! ^ P[a ,t aietame and t e'io{‘S,cr]] 

a{a,r)E@q cr,Te@q 

- q\ ^ P[t e (T.T are tame] •P[iT,T are tame] 

cr.re®, 

a is tame] • P[cr is tame] 

< q\-E[Zq^bal] Yj E[(yisXavne\<q\-E[Zq;^3if, [byT3], 

o-e®, 

as desired. □ 

Lemma 5.5. We have that 

EiZq f^^q- — E[Zq^}jcil\ 


Proof. We take 77 as in Lemma 5.3 and set 

S' ^ la eS^n @tame :\\a-a\\2>q]. 

As S' is compact, the fact that F{a) < F{a) for all ae S' additionally implies that there exists some 7 such that 
maXa^gt Fla) < Fla) - 7 . Then 

E[-Z^,tame' ^ l^'l exp{n(F(d) - 7 )} < n"!^ exp{nlFla) - 7 )} 

< exp{n(F(fl) - 7/2)} < E[Z(;,bai] • exp{-7/3} < E[Zg,bal]^, 


as desired. 

Finally, (5.5) follows from combining Lemmas 5.3-5.5. 


□ 


5.3 The Maximisation Problem: proof of Lemma 5.2 

5.3.1 The strategy 


The proof is based on the local variation technique developed in [9]. Roughly speaking, for each 0< s < q we will 
argue that for any arbitary a eQig, we can move slightly toward a nicer matrix while increasing F. The new matrix 
that we produce is then regular enough that we may perform calculations and compare it the point als) whose first 
s diagonal entries are IIq, and whose (i, 7 )-entries are equal to Iqlq- 5 ))“^ for i,j > s. As it turns out, als) comes 
close enough to maximising F over Qig (up to a negligible error term in each case). The final step is then to show 
that these points are still bounded above by the barycentre of & (i.e. by a). 

Let us take a moment to collect some results that will be used throughout the remainder of this section. In 
particular, it may come as no surprise that in a local variations argument we make extensive use of derivatives. 
Taking partials of F we have 


dair da 


ty 


Fla) - In—^ H— 
tlix 1 ■ 


ckla^^ ^ ^) 




2lq 


fc-i. 


\a\\\’ 


i,x,y<q. 


(5.6) 
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This represents the change in F when we increase atx at the expense of aty (see Proposition 5.7, which describes 
when the above quantity is positive). Further, we wiii often tackie the changes in entropy and energy separateiy. 
Further, we need the foiiowing eiementary inequaiities (cf. [9, Coroiiary 4.10]). 

Fact 5.6. Let qa e [0, 1]4 be such cii - 1/ q, and define 

h:[ 0 , Ij^lR, -zinz -(1 - z))n(l - z). 


Then 

(i) for /c [q] and r = Zie/ haveH[a] < h(r) + r)n|/| + (1 - r)\n{q- |/|), and 

(ii) for /c { 2 ,..., q} with 0 < |/| < q - 1 and r - T.i€j ifd^i < 1 

F[{a) < h{qai) + (1 - qa\)h{rl[\- qai)) + rin|/| + (l-r - qai)\n[q - |/| - 1). 

The foiiowing iemma is the main tooi to carry out the iocai variations argument. Recaii that ,5* is the set of aii 
matrices a = (aij)ij^[q] with entries > 0 such that Zy ajj = 1 /q for aii i. 

Lemma 5.7. SupposeaeSF. Ifi e [q] and0 f /Q [q] aresuch that for some number 3lnlnq/In q < p<l wehave 


0.995 

\]\>q^ and msyLoqJ^ < \n\nql In q], 


(5.7) 


then the matrix deS’ obtained from a by setting 


dxy= l{(x,y) t{i}xj}axy + 


l{[x,y)e{i}xj} 


I/I 




is such that F (a) < F{a). In fact, the inequality is strict unless a - a. 

Proof Take i e[q], J cz [q] as described and x,yej such that 

t —1 . 0.995 . . 

=mma.... < a.-,, <-—^ (p- ininq/inqj. 




IJ 


ty 


kq'^ 


We wiii show that (5.6) is positive for the range of atx and a/y that we have at hand. It wiii be convenient to make 
the substitution 6xv - a^~^ - a^~^ > 0 and instead consider whether 

■^y ly IX 


(fc-1) 


daix da 


‘y 


F{a) = in 


aiy 

tlix 


k-i\ ck{k-l){afy^ - 

l-2lql^-^ + \\a\\l 


= in 


4 ( 


ck[k- VjSxy 
l-2/q*:-i + ||fl| 


M8xy] > 0 . 


(5.8) 


After noting that A(0) = 0, it foiiows from the concavity of A that if 5* > 0 satisfies (5.8) then so does Sxy for aii 

0.999 


0 < 8xy <8*. Therefore we take 


8*^ 


kq^ 


— (u-\n\nql\nq] > max5;cy = max a- 

■1 x,y^J r.vF? I '■ 


x,y&] I 


k-\_ k-\\ 
'-iy ^ix 


and observe that aix < jjj Hjej tiij ^ we have after taking the exponentiai of (5.8) that 
ck{k- 1)5* 


exp 


l-2/q*^-i + ||fl||| 


< exp{(A:-l)lnq(p-lnlnq/ln( 7 )} = (q^/inp)*^ ^<[\J\llnq]^ ^ 


, ,i_fc 1.991nlnq 0.995 , ^ 1 


□ 
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In other words, if we take a row i and a set / of not too few columns such that the largest entry atj, j ^ J, is not too 
hig, then the function value does not drop if we replace all entries atj, j e /, hy their average. Thus, Proposition 5.7 
can he used to “flatten” parts of the matrix a without reducing the function value. The proof of Proposition 5.7 is 
merely hy analysing signs of dF/datj -dF/daij’ for 7 ,/e /. 

In what follows we will use Lemma 5.7 to show that Lemma 5.2 holds for each 0< s < q separately. Formally, 
we set out to show that: 

Claim 5.8. Suppose that s = 0. Then for all a e we have F{a) < F{a). 

Claim 5.9. Suppose that 1 < s < Then for all a e Sig.tame we haveF{a] < Fla). 

Claim 5.10. Suppose thatq°-^^^ < s< q - Then for all a e 2>s,tame we have Fla) < Fla). 

Claim 5.11. Suppose thatq- q^*^ < s< q. Then for all a e Q^s.tame we have Fla) < Fla). 

Lemma 5.2 is immediate from Claims 5.8-5.11. 


5.3.2 Proof of Claim 5.8 


We begin with the following consequence of Lemma 5.7. 

Claim 5.12. Suppose that aE 5F has an entry aij e [1.02/(^7A;), q~^l\.Q\lk)^^^^~^^]. 
entries 

a’^y- l{xf^ i}axy+l{x- i]q~^ lx,yE [q]) 


satisfies Fla') > Fla). 


Then the matrix (f e with 


Proof. Without loss of generality we may assume that a maximises Fla) over the set a e ,5^ with respect to an e 
[1.02/(^A:), ^ 7 “^( 1 . 01 /A:)^^^*^“^']. If we apply Proposition 5.7 to the set / = [(7]\{1} with p - ln(^ 7 - VjlXnq then the 
maximality of Fla) implies that a\j = (1 - qa\i)llqlq- 1)) for j > 2. Because a' is obtained from a byreplacing the 
first row replaced by lq~^,..., q~^), the change in entropy comes to 

Hla!) - Hla) - q~^ \nq - Hlqai) > ^ 7 “^ (ln( 7 -ln 2 - (1 - 1.02/A:)ln q)> q~^ ((1.02 In q)/k-ln2). (5.9) 

Furthermore, 




l = ^''+lq-f) 


1 — CJClii 

q{q-l] 




The derivative of the function E from Lemma 3.3 satisfies 

dEla) c 


dll all 


l-2^/l *^ + ||a 


< 1 . 001 ^ 7 *^ ^\nq. 


(5.10) 


(5.11) 


Hence, (5.10) implies that Ela) - Ela') < 1.02A: q Combining this bound with (5.9) and assuming that 

q> qQ for a large enough constant qo, we find Fla') - Fla) = Ella') - Ella) + Ela') - Ela) >0. □ 


Proof of Claim 5.8. The set ®o,tame is compact. Therefore, the continuous function F attains a maximum at some 
point a E ®o,tame- Assume for contradiction that af^ a. Then we will construct a sequence of matrices a[i], i e [q], 
such that a[0] = a, a[q] - a, with Fla[i + 1]) > Fla[i]) for all i < q and ^(^[0]) Fla[q]), clearly arriving at a 
contradiction to the maximality of Fla). Specifically, let a[0] = a and obtain a[i] from a[i - 1] by letting 

Uxyli] - l{x^ i]axyli- 1 ] + l{x= i]q~^ for i,x,yE [q\. 

This construction ensures that a[q] - a. To show that F(a[/ + 1]) > Fla[i]) we consider two cases. 
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Case 1: maXyEiq] atj < l.02l[qk). We apply Lemma 5.7 with J- [q] and/i= lSincea,y < 1.02 /(^7 A;), the assumption 
(5.7) is satisfied. Consequently, F{a[i]) > F{a[i - 1]), with equality iff a[i] - a[i - 1]. 

Case 2: maxjEi^^] aij > 1.02/(qfc). Claim 5.12 shows that F[a[i]) > F{a[i - 1]). 

Finally, since a^awe have a[i] a[i - 1] for some i e [q], whence F{a) = Fla[q]) > F(fl[0]) = F{a). Note that 
although we may temporarily leave @o,tame during this process, we are guaranteed to return to d e @o,tame- □ 


5.3.3 Proof of Claim 5.9 

The strategy of this proof is to compare an arhitary element of to a matrix that is more evenly distributed (using 
Proposition 5.7), to which we then compare the barycentre of the face of ® (i.e. d(s)) and finally, to which we 
compare d. Let 1 < s < and take aeQis- h foUows from Corollary 5.12 and the separability results that we 

may assume qau > 1 - tc for i < 5 with k - In^'' q!q^~^, and further, that we may also assume qatj < 1.02/A: for all 
i ^ j < s and s< i,j< q. Let qd be the singly-stochastic matrix with entries 

f aij ifielq],j<s, 
an = 1 

[—sLe>saie if ie[q],j>s. 

Since q - s = q(l - Oq{l)) we may apply Proposition 5.7 to / = [q]\[s] for any i e [q]. It follows that F{a) < F{d). We 
will now compare F{d) and Ffafs)). To this end we must first estimate F{a). We start with the entropy term. As d is 
stochastic and qda > 1 - tc for i < s, we find that 

ri = qY, aij = 1 - qau < k, for i < s. 

Further, if we set r/ = qYfj^i a for i > s then it follows from the fact that qa is doubly-stochastic that 


5 5 

E '■i = ^ E E aij = ^7 E E aij ^ KS, for i > s. 

i>s i>sj=l i>sj=l 

We know from Fact 5.6 that 

H(qdi] < h[ri) + ri\n{q- 1) < h{K) -i-Tclnq for i < s. 


and 


H{qdi) < h[ri) + r; Ins-i- (1 - n)ln(q- 5) < h{ri) - 1 - nlns-i- ln(q- s), for i > s. 

Since h is concave, it follows that 

( K s\ 

— -I- )csln5. 

q J 

Therefore 

1 S , , q- S I Ks\ KS 

H[d) = ln (7 H— > F[{qdi) < Inq-i— (h[K) + Klnq)-\ -ln(q- s) -1- Aj — H-In5 

<Ji^i q q \ q I q 

< In^-i- — —^ln(^/- s) -I- Oq{q^~^) [as s < and hixslq) - Oq{q^~^)\ 

q 

^mqd[s)) + Oq[q^-’‘) [by (5.3)]. (5.12) 
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Next we deal with estimation of the energy term, it will he convenient to break down the problem as follows 


i^ee4+i:e4+i:e4- 

i<sj<q i>sj>s i>sj<s 


(5.13) 


As the fc-norm is maximised when summands are as unequal as possible, we have ||d, ||^ < ^ for i < s. Further, 

by the same logic we have 


and 


As s/q< q ° we know 


1 




1 >SJ>S 


^ e>s 


^ij - 

i>sj<s \i>sj<s t 


EE4^ EE 


EE4^ 

i>sj<s 




for some 7 > 0. If we combine the above results then we have shown that 

II dll^ + (q - q-r = II d(s) II^ r?-!'. 


and so 

E{a)-E(a{s)) < -^^^(lldllj^ - ||d(s)||'^) < +o^d/q)) = Oci[q^~^). (5.14) 

5II all ^ 

Therefore it follows from (5.12) and (5.14) that 


F{a)<F[a)<E{d{s)) + Oq(q^ *"). 


Recalling that siq < q “ it follows from (5.4) that 


F{a]<F{d)<F{ais)) + Oqiq^ *^) 
= (1 - s/ q] ln(l - s/q) + 


{l-s/q) ln(l-s/^) + 


21n2 sln2 slnq Inq 


Inq 


,S:-1 


q^ 


2q^ q^ 1 q^ ^{1-s/q)^ ^ 


q^ ^\nq 


s-2q [q-s] 


2 n 2 


q^{q-sY 


+ Oq{q^ 


\nq 


21n2 Inq 

qk-i gk-i q^-^[\- s!qY~'^ 


,jt-i 


21n2 Inq 


In <7 




21nq 

fc-2 + ■) 


= —(l-s/a)+ , , , , , , , , 

q q^ ^ q^ ^ q^ ^(l-s/q) 

21n2 _ s 

< (l-s/^)ln(l-s/^) +—r—- + o^(^^ Y^F{a) - s!q) + Oq{q^ *). 

q 


As the ^{\-slq) is decreasing ins, we have shown that F( a) < F(a)-1/q+1/q^ + Oq{q^ ^). This implies our original 

q d 

assertion. □ 
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5.3.4 Proof of Claim 5.10 


Let < s< q - q^'^^ and take a e Qig- As before, we may assume qaa > 1 - tc for i < s, and qatj < 1.021 k for all 
i ^ j < s and 5 < i,j < q. Let qa be the singly-stochastic matrix with entries 


aij - < 


5-1 £fe[s]\{i} ^i( 
—sY.e>saie 
7 ^(<s 


if i- j e [5], 
ifi,j<s,i^j, 
if i > s, 
if j<s< i. 


Since s,q - s > q^'^^ we may apply Proposition 5.7 to / = [(7]\[5] and /' = [s]\{/} for any i e [q]. It follows that 
F{a) < F{a]. To estimate Fia] we will now define 


ri^qY, aij = ^ atj for i < s, and rt^ qY, aij = qY ^ij 

j>s j>s j<s j<s 

and since qa is doubly stochastic, 


r = E n = E r, s y 1 - qaa < ks. 
i>s i^s i<s 


Further, we also set 


ti = q Y atj-q Y aij <f- qaii<K for i < s. (5.15) 

;e[ 5 ]\{/} je|s]\{!} 

As before we wiU now estimate both the entropy, and the energy term separately. We know from Fact 5.6 (ii) that 

H[qdi) < hiqau) + - qauiHti![f - qan)) - 1 - t;ln(s- 1) -t- (1- U - qaii)\ri{q- s) 

< hiqan) + [1-qaii)h{ri/{f-qaa)) + tjlns-i- r, ln(t 7 - s) 

= -qanlniqan) - f,ln q - r/lnr, -t t/lns-i- r/lnCt/- s) < h{ti) + f,lns-i- h{ri) + r^ln(^ 7 - s), i < s, 

where the last line follows as the function g: x - (1 - x) ln(l - x) is decreasing with g'(x) < 1 for small x. If we set 
F[-^ T-issWli) + f(ln5) then it follows from the concavity of h that 

\ ^ ^ ^ 

— Y Hiqdi) < F[-\ — h{rls) H—ln(fl- s). 

cf its q q 

Furthermore, by Fact 5.6 (i) and the concavity of h, we have 

1 a — 5 r a — s — v 

— Y H{qdi) <- h[rl{q- s)) H— Ins-t -ln((7- 5). 

^ i>s ^ q q 

Combining these results, it follows that 

Hid) <lnq + H +\—hir/ s) — ln(fl- s) I - 1 - \— - h{rl{q- s)) - 1 - (r/fl)lnsl - 1 - — -InCtr- s). 

\q q j \ q I q 

Further as /z(x) < x(l - Inx), we have 

Hid) - H<lnq-\ — [2- 21n r - 1 - 21ns-i-ln((7 - s)] - 1 - — —^Inlr/- s) 

q q 

r , . q-s 

<\nq-\ — (2-1- 31n t/J H- iniq- s) + Oqillq), [as -zlnz < 1 for z > 0] 

q q 

= 2\nq+- {2 + 3\nq) -t (1 - slq)\nil - s!q) - -t OqiHq). (5.16) 
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Since s < q, we obtain 


21^^ i; = - y(h{ti] + ti(lns-2\nq]] < - Tihiti)- tilnq] < 


(5.17) 


where the last inequality follows from noting that maXj;£[o,i] h{x) - xlnq < 1/ q. Thus, by combining (5.16) and 
(5.17) we have shown 

H{d) <2lnq+- (2 + 31nr7) + (1 - siq)\n{l-siq) - T q + Oaillq). (5.18) 

‘I ^^i<s 

Next, we move on to estimating the energy term. As before we firstly estimate \\a\\^, then we apply a bound for 
dEld Hall? in order to approximate E{d] - E{a{s]). Firstly note from (5.15) that for i < s, we have 




and 


E 4 = 


s-lj (s-D* 


Moreover, since qd is stochastic and qdii> 1 -k if i < s, we have 


y d’^-^ix/qy, for i<s. 


;€[(/]\|s] 


Combining the above equations yields 

y WdiWl < sq~^-^ yti + Oq{q^~^'‘]. 

i<s ^ i<s 

Since qaa > 1 - jc for i < s we have qaij < x for j < s< i. By construction, this implies that qdij < x for j < s < i. 
Furthermore, we have that 

sr -'k A . ^..^k 

E E ^ij ^ and 

i>sj<s H i>sj>s 

We have shown that 


yydij^iq-sf[^^\ <q-\q-sr^ 


||d||^<sr? ^+q ’‘{q-sf ^-^yq + Oqiq^ '^^)^\\d{s)\\\- ^yti +Oq(q^ 

Q i<s Q i<s 


and so from (5.11) and (5.3), we have 


E{d) = Eidis]) + • (II dll- II d(5) II) 

dllall^ 


<E{d{s])-q'^ ^\nq{l + Oqillq])- —ryti + Oq(q^ 




<E[d{s))-^^^yti + Oq{q^ ^) 

^ i<s 

5 fclnfl 

= -21n^7 ^—\nq -^ t; + Oq{\lq). 

^ ^ i^s 


(5.19) 


Finally then, it follows from (5.18) and (5.19) that 


r , , (2 - A;) In q . 5 -^ 

F{a) < Fid) < — (2 + 31nq) + (1 - 5/q)ln(l - siq]-\ -^ q + Oqillq) 

^ ^ iss 


= (1 - s/q)ln(l -slq] + Oqillq) < —(1 - 5/q) + Oa(l/q). 

q 


Fortunately, our assumption < s< q - ensures that Fia) <0 < Eia). 


0.49 , 


□ 
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5.3.5 Proof of Claim 5.11 


Let q - y/q < s < q-l and take a e 2>s. As before we may assume qaa > 1 - jc for i e [s], and quij < 1.02/A: for all 
i ^ j < s and s < i,j < q. Let r, = ^ij- As qa is doubly-stochastic and qaa > 1 - )c for i < s, we have 

r = ^ ^ ^ ^ ^ 1 - qaa < ks. 

i<s i<s 


Further, we let 


^ qO-i]! 

j>s 


and 

i<s 


Since qa is doubly-stochastic we have 


f — ^ ^ qaij — ^ ^ qo-ij- 

issj>s i>s]^s 


(5.20) 


The strategy of this proof is to compare F{a] to F{q Hd) where id is the qx q identity matrix. To this end we firstly 
estimate the entropy of a. We now set H = ^T,i<s hiqaa) and as before apply Fact 5.6 (ii) and the concavity of h to 
observe that 


- YHiqai) < - y h[qaii) + riHtiIrt] + ti\n{q-s) + (r,- - f;)lns 

^ i<s ^ i<s 

— r t r -1 

<H-\ — h{t/r)-\ — ln{q- s) H-ins 

q q q 

— t t r — t 

<H -\—(1 - In f -I- Inr) H— ln{q- s) H-ins [as h[z] < z(l - Inz)]. 

q q q 

As -zlnz < 1 for z > 0, we have that - tin t < 1. Furthermore, as qa is doubly-stochastic we have that t< q- s, and 
so 


-^(1 - In f-i-lnr) < ——^ ■ (1 -t 0(1/fl)]. 
q q 


Therefore 


- YHiqai) < H + - ln[q- s) + - —^Ins-t ^-^(l-t 0^(1/^)). (5.21) 

q q q q 


We now move to estimating Fl(qai) for i > s. As is by now routine, we apply Fact 5.6 (i) along with the concavity of 
h and (5.20) to conclude that 


1 ^ 1 ^ 

- y Hiqat) < - y 

^ i>s ^ i>s 


Yqatj +Yqaijln{s]+ 1 -y q o-ij 

\i<s j<s ]<s 

t \ t q-s-t 

H— In s H-ln(u - s) 

q q 


\n{q- s) 


q-s ^ 

< h 

q \q- s 

q-s, t, q-s- f. 

< -ln2-i—InsH-ln(< 7 - s) 

q q q 


[as h{z) < ln2 for all z]. 


Finally then, we have from (5.21) and (5.22) that 


H{a) - \nq + H ^— y Fliqat) 

^ i<q 

< Int/H— Ins-I- — -ln2 -i- — -ln(^ 7 - s) -i- — -(l -i- 0(1/< 7 )) 

q q q q 


(5.22) 
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[as q-s< v^. 


<lnt 7 +H+—ln5 + — —^ln 2 + — ln (7 + ——+ 0 ( 1 /< 7 )), 

Moving on to the energy term, we firstly estimate || a|| As the norm is maximised when the summands are widely 
distributed, we have 


^ II ai ^ af; + 

i^s i<s /<s 


A similar argument then applies to the remaining q - s rows. Recalling Corollary 5.12, it follows that 

,fc 


^||a;||'^< (r?-s) 


1.02 

qk 


Therefore, 




and so 




Finally then, we have from (5.11) that 
E{a)-E{q~hd} 


1.02 


-1 


+ ^Y.[{qaii)’‘-l] + oa/q’'^'^). 

Q i<s 


- 1 -iifc II 


da 


■(ll«llfc-|k idllfc) 


<\nq- 


q-s 

1 


1.02 


-1 


+ — y liqaii)'^-!]]+ Oq (InqIq) 
^ i^s 


If we combine our estimates for the entropy and energy terms, we have shown that 


F[a)-F(q Hd) < Hh— InSH- — V [[qauy -\]) + On(\nqlq] 

+ lnq 


q- s 


1.02)'' 2 + ln2 


+ ■ 


ln^7 


■- 1/2 


Since maxo<^<i h(z) - zlnq< \l q and h{z] - h(l - z), if we set pa = 1 - qaa then 

H+ ^lns+^^ yKqauy -1]) ^ E [^(Pn) + P//ln^7+ ((1 - pu)^ - ijlnr/] 


^ !<S 


^ !<S 


- “ H [^(Pn) + Pn ln^-fcp;;ln^] + 0(1/^) < - ^ [Hpu) - painq] + 0(11 q) = 0(1/^). 

^7 i<s ^ i<s 


Finally then 


F(a)<F(q ^id)-lnql3q + Oqilnq I q] < F(q ^id]-—F(a]. 


□ 


References 

[1] D. Achlioptas, A. Coja-Oghlan: Algorithmic barriers from phase transitions. Proc. 49th FOCS (2008) 793-802. 


29 
















[2] D. Achlioptas, C. Moore: Random A:-SAT: two moments suffice to cross a sharp threshold. SIAM Journal on 
Computing 36 (2006) 740-762. 

[3] D. Achlioptas, C. Moore: On the 2-colorability of random hypergraphs. Proc. 6th RANDOM (2002) 78-90. 

[4] D. Achlioptas, A. Naor: The two possible values of the chromatic number of a random graph. Annals of Math¬ 
ematics 162 (2005), 1333-1349. 

[5] D. Achlioptas, A. Naor, Y. Peres: Rigorous location of phase transitions in hard optimization problems. Nature 
435 (2005) 759-764. 

[6] N. Alon, M. Krivelevich: The concentration of the chromatic number of random graphs. Combinatorica 17 
(1997) 303-313 

[7] B. Bollobas: The chromatic number of random graphs. Combinatorica 8 (1988) 49-55 

[8] A. Coja-Oghlan: Upper-bounding the fc-colorability threshold by counting covers. Electronic Journal of Com¬ 
binatorics 20 (2013) P32. 

[9] A. Coja-Oghlan, D. Vilenchik: Chasing the fc-colorability threshold. Proc. 54th FOCS (2013) 380-389. 

[10] A. Coja-Oghlan, L. Zdeborova: The condensation transition in random hypergraph 2-coloring. Proc. 23rd 
SODA (2012) 241-250. 

[11] A. Coja-Oghlan, K. Panagiotou: Catching the fc-NAESAT threshold. Proc. 44th STOC (2012) 899-908. 

[12] J. Ding, A. Sly, N. Sun: Proof of the satisfiability conjecture for large fc. Proc. 47th STOC (2015) 59-68. 

[13] O. Dubois, ]. Mandler: The 3-XORSAT threshold. Proc. 43rd FOCS (2002) 769-778. 

[14] M. Dyer, A. Frieze, C. Greenhill: On the chromatic number of a random hypergraph. Journal of Combinatorial 
Theory, Series B. 113 (2015), 68-122 

[15] P. Erdos, A. Renyi: On the evolution of random graphs. Magayar Tud. Akad. Mat. Kutato Int. Kozl. 5 (1960) 
17-61. 

[16] A. Goerdt, L. Falke: Satisfiability Thresholds beyond fc-XORSAT. CSR (2012) 148-159. 

[17] A. Frieze, N. Wormald: Random fc-Sat: a tight threshold for moderately growing fc. Combinatorica 25 (2005) 
297-305. 

[18] C. Greenhill, S. Janson, A. Rucihski: On the number of perfect matchings in random lifts. Combinatorics, 
Probability and Computing, 19 (2010), pp. 791-817. 

[19] H. Hatami, M. Molloy: Sharp thresholds for constraint satisfaction problems and homomorphisms. Random 
Structures and Algorithms 33 (2008) 310-332. 

[20] S. Janson, T. Luczak, A. Rucihski: Random Graphs, Wiley, New York, 2000. 

[21] W. Kauzmann: The nature of the glassy state and the behavior of liquids at low temperatures. Chem. Rev. 43 
(1948) 219-256. 

[22] M. Krivelevich, B. Sudakov: The chromatic numbers of random hypergraphs. Random Structures and Algo¬ 
rithms 12 (1998) 381-403. 

[23] F. Krzakala, A. Montanari, F. Ricci-Tersenghi, G. Semerjian, L. Zdeborova: Gibbs states and the set of solutions 
of random constraint satisfaction problems. Proc. National Academy of Sciences 104 (2007) 10318-10323. 

[24] A. Kupavskii, D. Shabanov: Golorings of partial Steiner systems and their applications. Journal of Mathemati¬ 
cal Sciences 206 (2015) 511-538. 


30 


[25] T. Luczak: The chromatic number of random graphs. Combinatorica 11 (1991) 45-54 


[26] T. Luczak: A note on the sharp concentration of the chromatic number of random graphs. Combinatorica 11 
(1991) 295-297 

[27] D. Matula: Expose-and-merge exploration and the chromatic number of a random graph. Combinatorica 7 
(1987) 275-284. 

[28] B. Pittel, G. Sorkin: The Satisfiability Threshold for k-XORSAT. Random Structures & Algorithms 25 (2016) 
236-268. 

[29] E. Shamir, J. Spencer: Sharp concentration of the chromatic number of random graphs G{n, p). Combinator¬ 
ica? (1987) 121-129 


31 


